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A Dynamic Boundary Guarding Problem with Translating Targets 

Stephen L. Smith Shaunak D. Bopardikar Francesco Bullo 



Abstract — We introduce a problem in which a service vehicle 
seeks to guard a deadline (boundary) from dynamically arriving 
mobile targets. The environment is a rectangle and the deadline 
is one of its edges. Targets arrive continuously over time on the 
edge opposite the deadline, and move towards the deadline 
at a fixed speed. The goal for the vehicle is to maximize 
the fraction of targets that are captured before reaching the 
deadline. We consider two cases; when the service vehicle is 
faster than the targets, and; when the service vehicle is slower 
than the targets. In the first case we develop a novel vehicle 
policy based on computing longest paths in a directed acyclic 
graph. We give a lower bound on the capture fraction of the 
policy and show that the policy is optimal when the distance 
between the target arrival edge and deadline becomes very 
large. We present numerical results which suggest near optimal 
performance away from this limiting regime. In the second case, 
when the targets are slower than the vehicle, we propose a 
policy based on servicing fractions of the translational minimum 
Hamiltonian path. In the limit of low target speed and high 
arrival rate, the capture fraction of this policy is within a small 
constant factor of the optimal. 



I. Introduction 

Vehicle motion planning in dynamic environments arises 
in many important autonomous vehicle applications. In areas 
such as environmental monitoring, surveillance and perimeter 
defence, the vehicle must re-plan its motion as it acquires in- 
formation on its surroundings. In addition, remote operators 
may add tasks to, or remove tasks from, the vehicle's mission 
in real-time. In this paper we consider a problem in which a 
vehicle must defend a boundary in a dynamic environment 
with approaching targets. 

Static vehicle routing problems consider planning a path 
through a fixed number of locations. Examples include the 
traveling salesperson problem (TSP) [1], the deadline-TSP 
and vehicle routing with time- windows [2]. Recently, re- 
searchers have looked at the TSP with moving objects. In [3] 
the authors consider objects moving on straight lines and 
focus on the case when the objects are slower than the vehicle 
and when the vehicle moves parallel to the x- or ?/-axis. The 
same problem is studied in [4], but with arbitrary vehicle 
motion, and it is called the translational TSP. The authors 
of [4] propose a polynomial-time approximation scheme to 
catch all objects in minimum time. Other variations of the 
problem are studied in [5] and [6]. 

Dynamic vehicle routing (DVR) is a class of problems 
in which vehicles must plan paths through service demand 
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Fig. 1. The problem setup. Demands are shown as black disks approaching 
the deadline at speed v. The service vehicle is a square. 



locations that arrive sequentially over time. An early DVR 
problem was the dynamic traveling repairperson problem [7], 
[8], where each demand assumes a fixed location upon 
arrival, and the vehicle must spend some amount of on- 
site service time at each location. This problem has also 
been studied from the online algorithm perspective [9], [10]. 
Other recent DVR problems include DVR with demands 
that disappear if left unserviced for a certain amount of 
time [11], and demands with different priority levels [12]. 
In our earlier work [13], we introduced a DVR problem 
in which demands arrive on a line segment and move in 
a perpendicular direction at a fixed speed slower than the 
vehicle. We derived conditions on the demand arrival rate and 
demand speed for the existence of a vehicle routing policy 
which can serve all demands, and a proposed a policy based 
on the translational minimum Hamiltonian path. 

Contributions: In this paper we introduce the following 
problem (see Fig. [T]): Targets (or demands) arrive according 
to a stochastic process on a line segment of length W. 
Upon arrival the demands move with fixed speed v towards 
a deadline which is at a distance L from the generator. 
A unit speed service vehicle seeks to capture the demands 
before they reach the deadline (i.e., within L/v time units 
of being generated). The performance metric is the fraction 
of demands that are captured before reaching the deadline. 

We assume that the arrival process is uniform along the 
line segment and temporally Poisson with rate A. In the 
case when the demands are faster than the service vehicle 
(i.e., V > 1) we introduce the novel Longest Path policy, 
which is based on computing longest paths in a directed 
acyclic reachability graph. When L > vW, we derive a 
lower bound on the capture fraction as a function of the 
system parameters. We show that the Longest Path policy is 
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the optimal policy when L is much greater than ^'VK. In the 
case when the demands are slower than the service vehicle 
(i.e, V < 1), we propose a policy based on the translational 
minimum Hamiltonian path called the TMHP-fraction policy. 
In the limit of low demand speed and high arrival rate, the 
capture fraction of this policy is within a small constant 
factor of the optimal. We present numerical simulations 
which verify our results, and show that the Longest Path 
policy performs very near the optimal even when L < vW . 

The paper is organized as follows. In Section |ll| we 
formulate the problem and in Section |IIl] we review some 



background material. In Section IV we consider the case of 
V >1 and introduce the Longest Path Policy. In Section |V| 
we study v < 1 and introduce the TMHP-fraction policy. 



Finally, in Section VI we present simulations results. 



the position of all demands in the region [0, VK] x (— oo, 0] 
at time t as Qunarrived(^)- Then, a non-causal policy is one 

for which p(t) = V(p(t), Q(t) U Qunarrived(t)). 

Problem Statement: The goal in this paper is to find 
causal policies P that maximize the fraction of demands that 
are captured Fcap(P), termed the capture fraction, where 



IFcap(^) := limsupE 



>t(t) 



ncapt(t)+nesc(t) 



III. Preliminary Combinatorial Results 

We now review the longest path problem, the distribution 
of demands in an unserviced region, and optimal tours/paths 
through a set of points. 



II. Problem Formulation 

Consider an environment £ := [0,1^] x [0, L] C as 
shown in Figure [T] The line segment [0, VK] x {0} C E is 
termed the generator, and the segment [0, VK] x {L} C £ 
is termed the deadline. The environment contains a single 
vehicle with position p(t) = [X{t)^Y{t)Y ^ ^, modeled as 
a first-order integrator with unit speed. Demands (or targets) 
arrive in the environment according to a temporal Poisson 
process with rate A > 0. Upon arrival, each demand assumes 
a uniformly distributed location on the generator, and then 
moves with constant speed v > in the positive ^-direction 
towards the deadline. If the vehicle intercepts a demand 
before the demand reaches the deadline, then the demand 
is captured. On the other hand, if the demand reaches the 
deadline before being intercepted by the vehicle, then the 
demand escapes. Thus, to capture a demand, it must be 
intercepted within L/v time units of being generated. 

We let Q{t) d £ denote the set of all outstanding demand 
locations at time t. If the zth demand to arrive is captured, 
then it is removed from Q and placed in the set Qcapt with 
cardinality ncapt- If the i\h demand escapes, then it is removed 
from Q and placed in Qesc with cardinality nesc- 

Causal Policy: A causal feedback control policy for the 
vehicle is a map V :£x ¥{£) where ¥{£) is the set 

of finite subsets of £, assigning a commanded velocity to 
the service vehicle as a function of the current state of the 
system: p(t) = 7^(p(t), Q(t)). 

Non-causal Policy: In a non-causal feedback control 
policy the commanded velocity of the service vehicle is a 
function of the current and future state of the system. Such 
policies are not physically realizable, but they will prove 
useful in the upcoming analysis. 

Formally, let the generation of demands commence at time 
t = 0, and consider the sequence of demands (^i,^2,---) 
arriving at increasing times (^1,^2,...)' with x-coordinates 
(xi,X2,...)- We can also model the arrival process by 
assuming that at time t = 0, all demands are located in 
[O^W] X (—00,0], move in the ^/-direction at speed v for 
alH > 0, and are revealed to the service vehicle when they 
cross the generator. Thus, at time t = 0, the position of the 
ith demand is {xi^v{t — ti)). We can define a set containing 



A. Longest Paths in Directed Acyclic Graphs 

A directed graph G = (V^E) consists of a set of vertices 
V and a set of directed edges E cV xV. An edge (v^w) G 
E is directed from vertex v to vertex w. A path in G is 
a sequence of vertices such that from each vertex in the 
sequence, there is an edge in E directed to the next vertex 
in the sequence. A path is simple if it contains no repeated 
vertices. A cycle is a path in which the first and last vertex 
in the sequence are the same. A graph G is acyclic if it 
contains no cycles. The longest path problem is to find a 
simple path of maximum length (i.e., a path that visits a 
maximum number of vertices). In general this problem is NP- 
hard as its solution would imply a solution to the well known 
Hamiltonian path problem [14]. However, if the graph is a 
DAG, then the longest path problem has an efficient dynamic 
programming solution [1] with complexity 0(|F| + |£^|), that 
relies on topologically sorting [15] the vertices. 



B. Distribution of Demands in an Unserviced Region 

Demands arrive uniformly on the generator, according to a 
Poisson process with rate A. The following lemma describes 
the distribution of demands in an unserviced region. For a 
finite set Q, we let \Q\ denote its cardinality. 

Lemma III.l (Distribution of outstanding demands, [13]) 

Suppose the generation of demands commences at time 
and no demands are serviced in the interval [0,t]. Let Q 
denote the set of all demands in [0, W] x [0, vt] at time 
t. Then, given a measurable compact region IZ of area A 
contained in [0, W] x [0, vt], 

n\n n Q| = n] = where A := XUvW). 

The previous lemma tells us that number of demands N 
in an serviced region of area A is a Poisson distributed 
with parameter \A/{vW). In addition, conditioned on N, 
the demands are independently and uniformly distributed. 
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C. The Euclidean Shortest Path/Tour Problems 

Given a set Q of n points in R^, the EucHdean traveUng 
salesperson problem (ETSP) is to find the minimum-length 
tour (i.e., cycle) of Q. Letting ETSP(Q) denote the minimum 
length of a tour of Q, we can state the following result. 

Theorem III.2 (Length of ETSP tour, [16]) Consider a 
set Q of n points independently and uniformly distributed 
in a compact set £ of area \8\. Then, there exists a constant 
/^TSP such that, with probability one, 

ETSP(Q) 



lim 



/^tspVI^I 



(1) 



The constant /^tsp has been estimated numerically as 
/5tsp « 0.7120 ±0.0002, [17]. 

The Euclidean Minimum Hamiltonian Path (EMHP) prob- 
lem is to compute the shortest path through a set of points. In 
this paper we consider a constrained EMHP problem: Given 
a start point s, a set of n points Q, and a finish point f , all 
in R^, determine the shortest path which starts at s, visits 
each point in Q exactly once, and terminates at f . We let 
EMHP(s, Q, f ) denote the length of the shortest path. 

Corollary III.3 (Length of EMHP) Consider a set Qofn 
points independently and uniformly distributed in a compact 
set 8 of area \£\, and any two points s,f G £. Then with 
probability one, 

EMHP(s,Q,f) 



lim 



•tsp 



where Ptsp is defined in Theorem III. 2 



The above corollary states that the length of the EMHP 
and the ETSP tour are asymptotically equal, and it follows 
directly from the fact that as n ^ +oo, the diameter of S is 
negligible when compared to the length of the tour/path. 

D. Trans lational Minimum Hamiltonian Path (TMHP) 

The TMHP problem is posed as follows. Given initial 
coordinates; s of a start point, Q := {qi, . . . , q^} of a set of 
points, and f of a finish point, all moving with speed v g]0, 1[ 
in the positive ^/-direction, determine a minimum length path 
that starts at time zero from point s, visits all points in the 
set Q and ends at the finish point. The following gives a 
solution [4] for the TMHP problem, 
(i) Define the map g : R^ ^ R^ by 

X y 



-;2 ' 1 



(ii) Compute the EMHP that starts at g^(s), passes through 
{g(qi),---,g(qn)} =: g(Q) and ends at g(f). 

(iii) To reach a translating point with initial position (x, y) 
from the initial position (X, F), move towards the point 

{x^y -\- vT), where 



^ _ V(l - v^)iX - xr + {Y~yf v{Y - y) ^ 
1 — v'^ 1 — v'^ 

The length TMHP^(s, Q, f) of the path is as follows. 



Lemma III.4 (TMHP length, [4]) Let the initial coordi- 
nates s = (xs,^s) cind f = (xf,?/f), and the speed of the 
points V G ]0, 1[. Then, 

TMHP„(s,af) =EMHP(g(s),g(Q),g(f)) + 

IV. Demand Speed Greater Than Vehicle Speed 

Here we develop a policy for the case when the demand 
speed V > 1. In this policy, the service vehicle remains on 
the deadline and services demands as per the longest path in 
a directed acyclic reachability graph. In this section we begin 
by introducing the reachability graph, and then proceed to 
state and analyze the Longest Path policy. 

A. Reachable Demands 

Consider a demand generated at time ti > at position 
(x, 0). The demand moves in the positive ^/-direction at speed 
V > 1, and thus {x{t)^y{t)) = {x^v{t — ti)) for each t G 
[ti, T], where T is either the time of escape (i.e., T = L/v-\- 
ti), or it is the time of capture. Now, given the service vehicle 
location (X(t),F(t)), a demand with position {x^y{t)) is 
reachable if and only if 



v\X{t)-x\<Y{t)-y{t). 



(2) 



That is, the service vehicle must be at a height of at least 
v\X{t) — x\ above the demand in order to capture it. 

Definition IV. 1 (Reachable set) The reachable set from a 
position (X, Y) e 8 is 

R{X, Y) := {{x, y)e£:v\X-x\<\Y- y\}. 

If the service vehicle is located at (X, Y), then a demand 
can be captured if and only if it lies in the set R{X^ Y). 

An example of the reachable set is shown in Figure [2] Next, 
given a demand in the reachable set, the following motion 
gives a method of capture. 

Definition IV.2 (Intercept motion) Consider a vehicle po- 
sition ((X(F),F(F)) and a demand position {x^yit)) G 
R{X{t)^ Y{t)) at time t > 0. In intercept motion, the service 
vehicle captures the demand by first moving horizontally at 
unit speed to the position (x^, Y{t)), and then waiting at the 
location for the demands arrival. 

Lemma IV.3 (Optimality of intercept motion) Consider 
V >1, and let the service vehicle be initially positioned on 
the deadline. Then, there is an optimal policy in which the 
service vehicle uses only intercept motion. 

Proof: Let the service vehicle be positioned at (X, L), 
and consider a demand at (x, ?/) G R{X^L). From equa- 
tion ([2]), we have v\X — x\ < L — y. If v\X — x\ = L — y, 
then deadline motion is the only way in which the demand 
can be captured. Thus, assume that v\X — x\ < L — y, and 
consider two cases; Case 1 in which intercept motion is used, 
and Case 2 in which the demand is captured at a location 
(x, F), where Y < L. 
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Fig. 3. A snapshot in the evolution of the Non-causal Longest Path policy. 
The vehicle has planned the solid red path through all demands, including 
those that have not yet arrived. In comparison, a dashed causal longest path 
is shown, which only considers demands that have arrived. 



Fig. 2. The construction of the reachability graph. The top-left figure 
shows the set of reachable points from a vehicle positioned on the deadline. 
The top-right and bottom-left figures show the reachable set from demand 
locations. The bottom-right figure shows the reachability graph. 



Notice that the position of each outstanding demand 
relative to the service vehicle position at capture is the same 
in Case 1 as in Case 2. Thus, the reachable set in Case 2 
is a strict subset of reachable set in Case 1 and the vehicle 
gains no advantage by moving off of the deadline. ■ 

Next, consider the set of demands in R{X{t)^Y{t)), and 
suppose the vehicle chooses to capture demand i, with 
position qi(F) = {xi^yi{t)) G R{X{t)^Y{t)). Upon capture 
at time T, the service vehicle can recompute the reachable 
set, and select a demand that lies within. Since all demands 
translate together, every demand that was reachable from 
qi(F), is reachable from qi(T). Thus, the service vehicle can 
"look ahead" and compute the demands that will be reachable 
from each captured demand position. This idea motivates the 
concept of a reachability graph. 



literature, such a policy would be referred to as an offline 
algorithm [9]. Figure [3] shows an example of a path generated 
by the Non-causal Longest Path policy. Note that the service 
vehicle will intercept each demand on the deadline, and thus 
the path depicts which demands will be captured, and in what 
order. 

Non-causal Longest Path (NCLP) policy 
Assumes: Vehicle is located on deadline and v > 1. 

1 Compute the reachability graph of the vehicle position 
and all demands in Q(0) U Qunarrived(O). 

2 Compute a longest path in this graph, starting at the 
service vehicle location. 

3 Capture demands in the order they appear on the path, 
intercepting each demand on the deadline. 



Lemma IV.5 (Optimal non-causal policy) If v > 1, then 
the Non-causal Longest Path policy is an optimal non-causal 
policy. Moreover, if v > 1, then for every causal policy P, 



cap 

(P) <Feap(NCLP). 



Definition IV.4 (Reachability graph) For v > 1, the 

reachability graph of a set of points {qi, . . . , q^} G is a 
directed acyclic graph with vertex set V := {1, . . . , n}, and 
edge set E, where for i^j G V, the edge {i^j) is in E if and 
only if q^j G R{cii) and j ^ i. 

Given a set Q of n outstanding demands, and a vehicle 
position (X, F), we can compute the corresponding reach- 
ability graph (see Fig. ^ in O(n^) computation time. In 
addition, by Section |III-A| we can compute the longest path 
in a reachability graph in O(n^) computation time. 

B. A Non-causal Policy and Upper Bound 

To derive an upper bound for v > 1, we begin by 
considering a non-causal policy. In the online algorithms 



Proof: The reachability graph Q(0) U Qunarrived(O) 
contains every possible path that the service vehicle can 
follow. When v > 1 the graph is a directed acyclic graph 
and thus the longest path (i.e., the path which visits the 
most vertices in the graph) is well defined. The vehicle uses 



intercept motion, and thus by Lemma IV. 3 the NCLP policy 
is an optimal non-causal policy, and its capture fraction upper 
bounds every causal policy. ■ 



C. The Longest Path Policy 

We now introduce the Longest Path policy. In the LP 
policy, the fraction is a design parameter. The lower r] 
is chosen, the better the performance of the policy, but this 
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Fig. 4. Scenario (a) and (b) for the proof of Theorem |IV.6| Path (a) visits 
five demands and thus Ca = 5. Path (b) visits four demands, yielding 
m = 4. The demand q2 is the highest on path (b) that can be captured 
from Pa(ti). Thus, n = 1, and 5 = Ca > m — n = 3. 



comes at the expense of increased computation. 



The Longest Path (LP) pohcy 



Assumes: Vehicle is located on deadline and v >1 

1 Compute the reachability graph of the vehicle position 
and all demands in Q(0). 

2 Compute a longest path in this graph, starting at the 
service vehicle location. 

3 Capture demands in the order they appear on the path, 
intercepting each demand on the deadline. 

4 Once a fraction G ]0, 1] of the demands on the path 
have been serviced, recompute the reachability graph of 
all outstanding demands and return to step 2. 

In the following theorem, we relate the Longest Path 
policy to its non-causal relative. Such a bound is referred to 
as a competitive ratio in the online algorithms literature [9]. 

Theorem IV.6 (Optimahty of Longest Path pohcy) If 

V >1, then 

Fcap(LP) > f 1 - Feap(NCLP), 



and thus the LP policy is optimal as vW/L 



-oo. 



Proof: Suppose that the generation of demands begins 
at t = and let us consider two scenarios; (a) the vehicle 
uses the Longest Path policy, and (b) the vehicle uses the 
Non-causal Longest Path policy. Then, at any instant in time 
ti > we can compare the number of demands captured in 
scenario (a) to the number captured in scenario (b) (refer to 
Fig.g. 

Let us consider a time instant ti where in scenario (a), the 
vehicle is recomputing the longest path through all outstand- 
ing demands Q{ti). Let us denote by Pa(^i) and Pb{ti), the 
vehicle position in scenario (a) and scenario (b), respectively, 
at time ti. In scenario (b), let the path that the vehicle will 
take through Q{ti) be given by (qi, q2, • • • , Qm) ^ Qih)- 
The demand qi is reachable from Pb{h), but it may not be 
reachable from Pa(^i). However, a lower bound on the length 
of the longest path in scenario (a) is: (q^+i, qn+2, • • • , qm), 
where qn+i, n G {0, . . . , m — 1}, is the highest demand that 



can be captured from Pa(^i), see Fig. |4] Thus, the length of 
the longest path in scenario (a), Ca, is at least 



Ca ^ Tn — 



(3) 



where m is the length of the path in scenario (b). 

Now, since the deadline has width W, the vehicle in 
scenario (a) can capture any demand (x, y) with y < L—vW. 
Thus, the demands qi, . . . , q^ must all have ^-coordinates in 
]L — vW, L]. Let the total number of outstanding demands at 



time ti be A/tot- Then, conditioned on A/tot, by Lemma III.l 
the expected number of outstanding demands contained in 

[0, W] X ]L - vW, L] is N,otvW/L. Hence, 

vW 

E [n\N,,,] = iVtot^Feap(NCLP). (4) 

Similarly, for the length of the path through Q{t i) in scenario 
(b), we have 

E[m|7V,,,] =7VtotFeap(NCLP). (5) 
Combining equations ^ and ^ with equation ([3| we obtain 

/ vW\ 
E [£,|7Vtot] > A^tot 1 - ^ Feap(NCLP), 



E 



> 



vW 



L J 

Feap(NCLP). 



But Ca/Ntot is the fraction of outstanding demands in Q{ti) 
that will be captured in scenario (a), and it does not depend 
on the value of A/tot- By the law of total expectation 



E 



A^to 



E 



E 



> 1 



vW 



Feap(NCLP). 



Each time the longest path is recomputed, the path in 
scenario (a) will capture at least this fraction of demands. 
Thus, we have Fcap(LP) > E [Ca/Ntot] and have proved the 
result. ■ 



Remark IV.7 (Conservativeness of bound) The bound in 
Theorem |IV.6| is conservative. This is primarily due to 
bounding the expected distance between the causal and non- 
causal paths by W. The distance between two independently 
and uniformly distributed points in [0,VK], is W/3. The 
distance is even less if the points are positively correlated 
(as is likely the case for the distance between paths). Thus, 
it seems that it may be possible to increase the bound to 



Fcap(LP) > ( 1 



vd 



Feap(NCLP), 



where d < W/3. 



□ 



The previous theorem establishes the performance of the 
Longest Path policy relative to a non-causal policy. However, 
the LP policy is difficult analyze directly. This is due to the 
fact that the position of the vehicle at time t depends on 
the positions of all outstanding demands in Q{t). Thus, our 
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approach is to lower bound the capture fraction of the LP 
poHcy with a greedy policy: 

The Greedy Path (GP) policy 
Assumes: Vehicle is located at (X, L) 

1 Compute the reachability set R{X^L). 

2 Capture the demand in R{X^ L) with the highest 
^-coordinate using intercept motion. 

3 Repeat. 

Given a set of outstanding demands Q{t) at time t, the 
Greedy Path policy generates a suboptimal longest path 
through Q(t). In addition, the vehicle position is independent 
of all outstanding demands, except the demand currently 
being captured. Thus, the capture fraction of the Greedy Path 
policy provides a lower bound for the capture fraction of 
the Longest Path policy. We are now able to establish the 
following result. 

Theorem IV.8 (Lower Bound for Longest Path pohcy) 

If L > vW, then for the Longest Path policy 



Deadline 



Fcap(LP)>Feap(GP)> 

where a = \W/2 and erf : 
function. 



1 



7raerf(Y^) + e " ' 

[—1, 1] is the error 



Proof: We begin by looking at the expression for the 
capture fraction. Notice that if ncapt(^) > for some t > 0, 
then 



lim sup E 



>t(t) 



(^)+nesc(^) 



lim sup E 



1+ 



^esc(*) 
''^capt (*) 



> 1 



lim sup E 

t^+oo 



(6) 

where the last step comes from an application of Jensen's 
inequality [18]. Thus, we can determine a lower bound on 
the capture fraction by studying the number of demands that 
escape per captured demand. 

Let us study the time instant t at which the service vehicle 
captures its ith demand, and determine an upper bound on 
the number of demands that escape before the service vehicle 
captures its (i + l)th demand. Since we seek a lower bound 
on the capture fraction of the LP policy, we may consider 
the path generated by the Greedy Path policy. In addition, 
we consider the worst-case service vehicle position; namely, 
the position (0, L) (or equivalently (VK, L)). 

From the position (0, L), the reachable set is 

i?(0,L) = {{x,y) e£ :vx<L}. 

Let Ry denote the reachable set intersected with [0, VF] x 
[L — y^L], where y G [0,L], and let \Ry\ denote its area. 
Then, 

if y<vW, 
\yW-^, if y>vW. 

An illustration of the set Ry is shown in Figure |5] Let yd 
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Fig. 5. The setup for the proof of Theorem |IV.8| The service vehicle is 
located at (0, L). All demands in the region escy^ escape while capturing 
the demand with the highest y-coordinate. 



be the ^/-distance to the reachable demand with the highest 
^/-coordinate. That is, 

yd — min \L — y}, 

ix,y)eQit)nRio,L)^ 

where Q{t) is the set of outstanding demands at time t. By 
Lemma |in.l[ the probability that a subset B C £ with area 
\B\ contains zero demands is given by 

P[|SnQ(t)| =0] =e-^l^l/^^^\ 

where \B H Q{t)\ denotes the cardinality of the finite set 
BnQ{t). Thus, 

nvd >y]= n\Ry n Q{t)\ = o] = e-^i^-i/^^^). 

The probability density function of yd for yd ^ vW is 



^ = Ty' 
A 



-Xy'^ /{2v'^W) 



ye 



Now, given yd, all demands residing in the region esc^^ := 
([0, VI^] X [L - yd,L\) \ Ry^ will escape (see Fig. [5]). The 



area of esc^y^ is 



esc,. 



2 



if yd < vW, 
if yd > vW. 

From Section |in-B| the expected number of outstanding 
demands in an unserviced region of area A is XA/{vW). 
Thus, given that the vehicle is located at (0, L), the expected 
number of demands that escape while the service vehicle is 
capturing its (i + l)th demand is given by 

^E[|esc,J] 



E 



A 



vW 

vW 



yW 



r 

2v 



f{y)dy 



> vW] 



Applying the probability density function and cumulative 
distribution function of yd we obtain 



E[ne: 



A^ 



Jo V 



yW 



r 

2v 



ye 
XW 



Xy^/(2v^W)^y 
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To evaluate the integral, consider the change of coordinates 
z := yjvW , and define a := AVK/2. After simplifying, the 
integral becomes 



dz. 



(8) 



Integrating by parts we obtain 

V7raerf(vc^) + ae~" + — 1, 
where erf : R ^ ["l^ 1] is the error function: 

erf(x) = - / e-^'^^t. 
^ Jo 

Substituting equation ^ into equation ^ we obtain 

E [nesc,i] = V^erf(vc^) + e""^ - 1. 



Since E [nesc,i] is computed for the worst-case vehicle posi- 
tion (0, L), and since this expression holds at every capture, 
we have that 



lim sup E 



nesc(t) 
''^capt(^) 



< V^erf(vc^) +e-^ - 1, 
and thus by equation ^ we obtain the desired result. 



V. Demand speed less than vehicle speed 

In this section we study the case when the demand speed 
V < 1. For this case, an upper bound on the capture fraction 
has been derived in [13]. We introduce a policy which is a 
variant of the TMHP-based policy in [13], and lower bound 
its capture fraction in the limit of low demand speed and 
high demand arrival rate. 



A. Capture Fraction Upper Bound 

The following theorem upper bounds the capture fraction 
of every policy for the case of < 1. 





Fig. 6. The TMHP-fraction policy. The left-hand figure shows a TMHP 
through all outstanding demands. The right-figure shows the instant when 
the vehicle has followed the path for L/(2'u) time units and recomputes its 
path, allowing some demands to escape. 



following policy utilizes this path to service demands. 



The TMHP-fraction (TF) policy 



Assumes: Vehicle is located on the line y = L/2. 

1 Compute a translational minimum Hamiltonian path 
through all outstanding demands in [O^W] x [0,L/2], 
starting at the service vehicle position, and terminating 
at the demand with the lowest ^-coordinate. 

2 if time to travel entire path is less than L/(2v) then 

3 Service all outstanding demands by following the 
computed path. 

4 else 

5 Service outstanding demands along the computed 
path for L/(2v) time units. 

6 Repeat. 

Figure [6] shows an example of the TMHP-fraction policy. 
In contrast with the LP policy, where the vehicle remains on 
the deadline, in the TMHP-fraction policy the vehicle follows 
the TMHP using minimum time motion between demands as 
described in Section IIII-DI Notice that none of the demands 
in the region [0,1^] x [0,L/2] at time t will have escaped 
before time t + L/{2v). Thus, the vehicle is guaranteed that 
for the first L/(2v) time units, all demands on the TMHP 
path are still in the environment. For the TMHP-fraction 
policy we have the following result. 



Theorem V.l (Capture fraction upper bound, [13]) If 

V <1, then for every causal policy P 

The proof of the above theorem is contained in [13], and 
relies on a computation of the expected minimum distance 
between demands. Notice that for low demands speed, i.e., 
^ ^ 1, it may be possible to achieve a capture fraction of 
one, even for high arrival rates. 

B. The TMHP-fraction Policy 

In Section IIII-DI we reviewed the translational minimum 
Hamiltonian Path (TMHP) through a set of demands. The 



Theorem V.2 (TMHP-fraction policy lower bound) In 

the limit as v ^ 0^ and A +oo, the capture fraction of 
the TMHP-fraction policy satisfies 

Proof: Consider the beginning of an iteration of the 
policy, and assume that the duration of the previous iteration 
was L/{2v). In this case, the vehicle has ^/-coordinate Y G 
[L/2,L], and by Lemma [iiLl] the region 7^ := [0,W] x 
[0,1/ /2] contains a number of demands that is Poisson 
distributed with parameter XL/{2v). Conditioned on N, the 
demands are independently and uniformly distributed in IZ. 

Now, we make use of the following three facts. First, as 
V ^ 0+, the length of the TMHP constrained to start at the 
vehicle location and end at the lowest demand, is equal to 
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the length of the EMHP in the corresponding static instance, 



as described in Lemma III.4 Second, from Corollary III. 3 



for uniformly distributed points, the asymptotic length of a 
constrained EMHP is equal to the asymptotic length of the 
ETSP tour. Third, as v 0+, and A +00, we have 
that N tends to +00 with probability one. Using the above 
facts we obtain that the length of the TMHP starting at 
the vehicle position, passing through all demands in 71, and 
terminating at the demand with the lowest ^-coordinate, has 
length in the limiting regime as v ^ 0+, 

and A +00. 

The vehicle will follow the TMHP for at most L/{2v) 
time units, and thus will service cN demands, where 



min < 1, 



pTSpvV2NW 



Now, the random variable N has expected value E [N] — 
\L/(2v) and variance = XL/{2v). By the Chebyshev 
inequality, P[|A/' - E [AT] | > a] < cr^/a^, and thus letting 
a = ^/vE [N] , we have 



F[N > (1 + V^)E [N]] < 



vE [N] 



2 

XL' 



Thus, we have 



c > min < 1, 



/5tsp\/(1 + V^)^AW 



with probability at least 1 
with probability 1, 



-2 /{XL). In the limit as A ^ +00, 



c > min 



(9) 



Therefore, if the previous iteration had duration at least 
L/{2v), then the total fraction of demands captured in the 
current iteration is given by equation 

The other case is that the previous iteration had duration 
T < L/{2v). In this case, all outstanding demands in the 
region 7^ := [0, VI^] x [0, L/2] lie in a subset [0, VI^] x [0, vT], 
and the subset contains a number of demands N that is 
Poisson distributed with parameter AT < XL/(2v). Thus, 
in this case there are fewer outstanding demands, and the 



bound on c still holds. Thus, 
the desired result. 



Fcap(TF) > c, and we obtain 



Remark V.3 (Bound comparison) In the limit as v ^ 0+, 
and A +00, the capture fraction of the TMHP-fraction 
policy is within a factor of 2/3tsp ~ 1-42 of the optimal. □ 

VI. Simulations 

We now present two sets of results from numerical ex- 
periments. The first set compares the Longest Path policy 
with 7^ = 1 to the Non-causal Longest Path policy and to 
the theoretical lower bound in Theorem IIV. 8 1 The second set 
compares the TMHP-fraction policy to the policy indepen- 
dent upper bound in Theorem |V.1| and the lower bound in 
Theorem |y2 



— LP policy experimental 
---Noncausal LP experimental 

— Lower bound 




0.9 

0.8 

c 0.7 
o 

0-6 

0.5 
i.0.4 



-Longest Patli 
-Fundamental limit 



(a) 



0.04 0.06 

Arrival rate 
: 2 and L > vW . 



0.04 0.06 0.08 

Arrival rate 



(b) i; = 5 and L < vW. 



Fig. 7. Simulation results for LP policy (solid red line with error bars 
showing ± one standard deviation) and the NCLP policy (dashed black 
line) for an environment of width W = 12 and length L = 500. In (a), 
L > vW, and the lower bound in Theorem |lV. 8 1 is shown in solid green. 





(a) Demand speed v = 0.01. 



Arrival rate 
(b) Demand speed v = 0.05. 



Fig. 8. Simulation results for TM HP-fr action policy. The solid black curve 
shows the upper bound in Theorem [V.l| and the dashed line shows the lower 
bound in Theorem I V. 2 1 Numerical results are shown with error bars. 



To simulate the LP and the NCLP policies, we perform 10 
runs of the policy, where each run consists of 5000 demands. 
A comparison of the capture fractions for the two policies is 
presented in Figure [7] When L > vW, the capture fraction of 
the LP policy is nearly identical to that of the NCLP policy. 
Even in Figure |7(a)[ where L < vW, the capture fraction 
of the LP policy is within 2% of the NCLP policy, and thus 
the optimal. This suggests that the Longest Path policy is 
essentially optimal over a large range of parameter values. 

To simulate the TMHP-fraction policy, the linkerrj^ 
solver is used to generate approximations to the optimal 
TMHP. For each value of arrival rate, we determine the 
capture fraction by taking the mean over 10 runs of the 
policy. A comparison of the simulation results with the 
theoretical results from Section |V| are presented in Figure [5] 
For V = 0.01 in Fig. |8(a)[ the experimental results are in 
near exact agreement with the theoretical lower bound in 
Theorem V.l For v = 0.05 in Fig. 8(b) the experimental re- 
sults are within 5% of the theoretical lower bound. However, 
notice that the experimental capture fraction is smaller than 
the theoretical lower bound. This is due to several factors. 
First, we have not reached the limit as v ^ 0+ and A ^ +00 
where the asymptotic value of /^tsp ^0.712 holds. Second, 



we are using an approximate solution to the optimal TMHP, 

^ linkern is freely available for academic research use at 
|http : //www . tsp . gatech . edu/concorde . html| 
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generated via the linkern solver. 

VII. Conclusions 

In this paper we introduced a pursuit problem in which a 
vehicle must guard a deadline from approaching demands. 
We presented novel policies in the case when the demand 
speed is greater than the vehicle speed, and in the case when 
the demand speed is less than the vehicle speed. In the former 
case we introduced the Longest Path policy which is based on 
computing longest paths in the directed acyclic reachability 
graph, and in the latter case we introduced the TMHP- 
fraction policy. For each policy, we analyzed the fraction 
of demands that are captured. 

There are many areas for future work. The Longest Path 
policy has promising extensions to the case when demands 
have different priority levels, and to the case of multiple 
vehicles. We would also like to fully characterize the capture 
fraction when L < vW, and tighten our existing bounds to 
reflect the near optimal performance shown in simulation. 
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